Inferring Generative Model Structure with Static Analysis

نویسندگان

  • Paroma Varma
  • Bryan D. He
  • Payal Bajaj
  • Nishith Khandwala
  • Imon Banerjee
  • Daniel L. Rubin
  • Christopher Ré
چکیده

Obtaining enough labeled data to robustly train complex discriminative models is a major bottleneck in the machine learning pipeline. A popular solution is combining multiple sources of weak supervision using generative models. The structure of these models affects training label quality, but is difficult to learn without any ground truth labels. We instead rely on these weak supervision sources having some structure by virtue of being encoded programmatically. We present Coral, a paradigm that infers generative model structure by statically analyzing the code for these heuristics, thus reducing the data required to learn structure significantly. We prove that Coral's sample complexity scales quasilinearly with the number of heuristics and number of relations found, improving over the standard sample complexity, which is exponential in n for identifying nth degree relations. Experimentally, Coral matches or outperforms traditional structure learning approaches by up to 3.81 F1 points. Using Coral to model dependencies instead of assuming independence results in better performance than a fully supervised model by 3.07 accuracy points when heuristics are used to label radiology data without ground truth labels.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Bayesian approach to inference of population structure

Methods of inferring the population structure‎, ‎its applications in identifying disease models as well as foresighting the physical and mental situation of human beings have been finding ever-increasing importance‎. ‎In this article‎, ‎first‎, ‎motivation and significance of studying the problem of population structure is explained‎. ‎In the next section‎, ‎the applications of inference of p...

متن کامل

Voice-based Age and Gender Recognition using Training Generative Sparse Model

Abstract: Gender recognition and age detection are important problems in telephone speech processing to investigate the identity of an individual using voice characteristics. In this paper a new gender and age recognition system is introduced based on generative incoherent models learned using sparse non-negative matrix factorization and atom correction post-processing method. Similar to genera...

متن کامل

Inferring Block Structure of Graphical Models in Exponential Families

Learning the structure of a graphical model is a fundamental problem and it is used extensively to infer the relationship between random variables. In many real world applications, we usually have some prior knowledge about the underlying graph structure, such as degree distribution and block structure. In this paper, we propose a novel generative model for describing the block structure in gen...

متن کامل

A Simple Approach to Static Analysis of Tall Buildings with a Combined Tube-in-tube and Outrigger-belt Truss System Subjected to Lateral Loading

In this paper, an efficient technique is presented for static analysis of tall buildings with combined tube-in-tube and outrigger-belt truss system while considering shear lag effects. In the process of replacing the discrete structure with an elastically equivalent continuous one, the structure is modeled as two parallel cantilevered flexural-shear beams that are constrained at the outrigger-b...

متن کامل

Static and Dynamic Analysis of Bus Structure and Chassis of O-457

With due attention to the fact that the local and foreign vehicle industries are changing and modifying the previous designs in order to produce new designs, the components of self-propelled are to be differently analyzed. Static and dynamic analysis is one of them. In this paper, chassis and body of a o-457 bus were studied and analyzed under finite element method (using ANSYS).This process wa...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Advances in neural information processing systems

دوره 30  شماره 

صفحات  -

تاریخ انتشار 2017